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Background of the Invention 

Field of the Invention 

The present invention relates to a method for generating transcriptionally 
active DNA fragments. More specifically, the method relates to synthesis of a DNA 
15 fragment by polymerase chain reaction (PCR) using nested primers, promoter 
sequences and terminator sequences. 

Description of the Related Art 

In addition to the tremendous progress made in the past few years in 

20 sequencing the human genome, efforts have also been made to sequence other 
organisms that are of biomedical importance. For example, complete genomic 
sequences have been obtained for Borrelia burgdorferi (cause of Lyme disease), 
Chlamydia, Heliobacter pylori and Mycobacterium tuberculosis. The fast-growing 
sequence information provides immense opportunities to reveal the basic biology of 

25 related organisms at the gene/molecular level and to develop novel therapeutics or 
vaccines against various pathogens. 

However, this vast sequence information also mandates a much more efficient 
and streamlined way to screen and identify genes of interest from tens of thousands of 
candidate genes. The conventional approach to gene screening and identification 

30 involves generation of a cDNA library, subcloning the DNA inserts into plasmid 



vectors (expression vectors), purifying plasmid DNA from bacteria for each individual 
cDNA clone and transfecting animal cells or tissues for functional analysis of the 
encoded gene product. This method, even in conjunction with the use of polymerase 
chain reaction (PCR) to generate cDNA fragments to allow directional and in-frame 
cloning, is still time consuming, costly and difficult to automate. 

The present invention provides a simple, rapid method for the generation of 
transcriptionally active DNA fragments. 

Summary of the Invention 
One embodiment of the present invention is a method for generating a 
transcriptionally active DNA molecule, comprising polymerase chain reaction (PCR) 
amplification of said DNA molecule in the presence of a first DNA fragment (Fl), 
second DNA fragment (F2), first primer (PI), a second primer (P2), a third primer 
(P3) and a fourth primer (P4) wherein: Fl comprises a promoter sequence; F2 
comprises a terminator sequence; PI is complementary to the 5' end of Fl; P2 is 
complementary to the 3' end of F2; P3 comprises a first region complementary to the 
3' end of Fl and a second region complementary to the 5' end of said DNA molecule; 
P4 comprises a first region complementary to the 5' end of F2 and a second region 
complementary to the 3' end of said DNA molecule, whereby a transcriptionally 
active DNA molecule is produced by said PCR amplification. Preferably, Fl is the 
cytomegalovirus IE promoter. In one aspect of this preferred embodiment, the 
transcriptionally active DNA molecule encodes a therapeutic gene. The method may 
further comprise the step of adding a PNA tail to the 5' -end of PI and P2 prior to the 
PCR amplification. Preferably, a thymidine base immediately precedes the region of 
complementarity between the third primer P3 and the first DNA fragment Fl. In 
another aspect of this preferred embodiment, a thymidine base immediately precedes 
the region of complementarity between the fourth primer P3 and the second DNA 
fragment F2. The method may also further comprise the step of adding a PNA clamp 
to said transcriptionally active DNA molecule after said PCR amplification. 



Preferably, the method further comprises the step of adding a PNA molecule via a 
linker (PNA clamp tail) to primers PI and P2 prior to the PCR amplification. 

Brief Description of the Drawings 
Figure 1 is a schematic diagram of the gene amplification method of the 
present invention. Oligonucleotide primers PI and P2 are complementary to the 5' 
end of a promoter sequence (Fl) and the 3' end of a terminator sequence (F2), 
respectively. P3 and P4 are oligonucleotide primers which contain regions 
complementary to opposite ends of Gene X. In addition, P3 and P4 contain regions 
which are complementary to the 3' region of Fl and the 5' region of F2. An excess 
amount of primers PI and P2 are combined with Gene X, or with a mixture of DNA 
containing Gene X, and with Fl, F2, P3 and P4, or with a mixture of DNA containing 
Fl, F2 and Gene X which has been PCR amplified using P3 and P4, and subjected to 
PCR to produce a transcriptionally active linear DNA fragment containing Fl, F2 and 
gene X. 

Figure 2 is a schematic diagram showing DNA fragment Fl which contains a 
thymidine base immediately preceding the region of complementarity with the PCR 
intermediate (primer P3). 

Figure 3 is a schematic diagram showing the use of peptide nucleic acid (PNA) 
sequences and PNA clamps to protect the ends of linear expression DNA fragments. 
Primers PI and P2 contain a PNA tail which is resistant to proteolytic and 
exonuclease degradation. The resulting linear expression DNA fragment contains a 
PNA tail at each of the 5 '-ends. The 3' ends are protected by addition of a PNA 
"clamp" which is also resistant to proteolytic and exonuclease degradation. 

Figure 4 is a schematic diagram showing the use of a PNA "clamp" tail to 
protect the 5' ends of the linear expression DNA fragment, followed by formation of a 
clamp during PCR to protect the 3' ends. 

Detailed Description of the Preferred Embodiments 
The present invention provides a simple, efficient method for generating 
transcriptionally active DNA fragments which can be readily transfected into animal 



cells or tissues by conventional nucleic acid transfection techniques, without the need 
for subcloning into expression vectors and purification of plasmid DNA from bacteria. 
The transcriptionally active DNA fragments are synthesized by polymerase chain 
reaction (PCR) amplification of any gene of interest using nested oligonucleotide 
primers and two DNA fragments, one of which comprises an active transcription 
promoter sequence and one of which comprises a basic transcription terminator 
element. A first primer is complementary to the DNA fragment comprising the 
promoter; a second primer is complementary to the DNA fragment comprising the 
terminator; a third primer is complementary to both the promoter sequence and one 
end of the gene of interest; and a fourth primer is complementary to both the 
terminator sequence and the other end of the gene of interest. These promoters, genes 
and terminators are linked in an expression cassette as shown in Figure 1. In addition, 
the ends of linear DNA containing the expression cassette can be protected from 
exonuclease digestion during and after transfection by incorporating peptide nucleic 
acid (PNA) sequences and PNA clamps as shown in Figures 3 and 4. 

As used herein, the term "promoter" is a DNA sequence which extends 
upstream from the transcription initiation site and is involved in binding of RNA 
polymerase. The promoter may contain several short (<10 base pair) sequence 
elements that bind transcription factors, generally dispersed over >200 base pairs. A 
promoter that contains only elements recognized by general and upstream factors is 
usually transcribed in any cell type. Such promoters may be responsible for 
expression of cellular genes that are constitutively expressed (sometimes called 
housekeeping genes). There are also tissue-specific promoters limited to particular 
cell types, such as the human metallothionein (MT) promoter which is upregulated by 
heavy metal ions and glucocorticoids. 

As used herein, the term "terminator" is a DNA sequence represented at the 
end of the transcript that causes RNA polymerase to terminate transcription. This 
occurs at a discrete site downstream of the mature 3' end which is generated by 
cleavage and polyadenylation. 



The present method can be used to quickly generate nuclease-resistant and 
transcriptionally active linear DNA molecules which express any desired gene with 
any known sequence. The linear DNA can then be delivered into animal cells or 
tissues for functional analysis, vaccination and other pharmaceutical applications. 
This method also avoids problems associated with bacterial growth such as toxicity 
and stability. This method can also be completely automated for use in high- 
throughput screening methods. 

Referring now to Figure 1, oligonucleotide primers PI and P2 are 
complementary to the 5' region of DNA fragments Fl and F2, respectively. Fragment 
Fl comprises a transcription promoter (darkened region) and fragment F2 comprises a 
transcription terminator (darkened region). Primers P3 and P4 are complementary to 
the 3' ends of Fl and F2, respectively, and to different ends of the DNA fragment 
containing Gene X. Figure 1 shows the putative intermediates and the final product 
which are formed in the present PCR-based method when PI, P2, P3, P4 , Fl and F2 
are added to a DNA template comprising Gene X. It should be noted that amount of 
primers PI and P2 present in the reaction mixture are preferably at least about 100 
fold greater than the amounts of P3, P4, Fl and F2, to minimized the amounts of 
intermediate products generated during the reaction. To generate the first 
intermediate, primer P3 and P4 are used which amplify from the promoter and 
terminator sequences of Fl and F2 across Gene X, resulting in an intermediate having 
the promoter and terminator sequences on the ends and Gene X in between these 
sequences. 

The second intermediate is formed by hybridization of the first intermediate 
with fragments Fl and F2 via their complementary promoter and terminator sequences 
(darkened regions), followed by PCR amplification from primers P3 and P4, resulting 
in a DNA segment comprising the entire Fl, F2 and Gene X. 

In the last step of the reaction, PCR amplification of the second intermediate 
using primers PI and P2 results in amplified amounts of the complete transcriptionally 
active DNA fragment. 



The method of the invention may be performed by adding all of the 
components in a single reaction mixture. Alternatively, two separate PCR reactions 
can be performed. In the first reaction, the template gene, P3 and P4 are used first. 
The product of this reaction is then used as a template for a second PCR reaction 
5 involving fragments Fl and F2, plus primers PI and P2. 

The generation of final products using the nested PCR method describe above 
is dependent on the sequences of the Fl or F2 fragments at the junction between the 
region overlapping P3 and P4, respectively. This is due, at least in part, to the 
addition of an extra adenosine base (A) at the 3' end of the PCR fragment by Taq 
10 DNA polymerase which is commonly used in PCR protocols. This could produce a 
mismatch between the PCR intermediate generated by P3/P4 and fragments Fl and 
F2. 

Thus, in a preferred embodiment, the overlap between the PCR intermediate 
(P3 primer) and Fl, and P4/F2, is designed such that a thymidine base (T) 

15 immediately precedes the overlap region in fragments Fl and F2 (Fig. 2, only Fl 
shown). In this case, even with the addition of an "A" base to the 3' end to the 
intermediate during PCR amplification, the overlap between F1/F2 and the respective 
PCR intermediates is still perfectly maintained. 

Peptide nucleic acids (PNA) are nucleic acid analogs in which the entire 

20 deoxyribose-phosphate backbone has been exchanged with a chemically completely 
different, but structurally homologous, polyamide (peptide) backbone containing 2- 
aminoethyl glycine units. Unlike DNA, which is highly negatively charged, the PNA 
backbone is neutral. Therefore, there is much less repulsive energy between 
complementary strands in a PNA-DNA hybrid than the comparable DNA-DNA 

25 hybrid, and consequently they are much more stable. 




addition, molecules called PNA "clamps" have been synthesized which 



have two Identical PNA sequences joined by a flexible hairpin linker containing three 
8-amino-3,6^dioxaoctanoic acid units. When a PNA clamp is mixed with a 
complementaryshomopurine or homopyrimidine DNA target sequence, a PNA-DNA- 
30 PNA triplex hybrid can form which is extremely stable (Bentin et al., Biochemistry 



* * 

^8863-8869, 1996; Egholm et al., Nucleic Acids Res. 23:217-222, 1995; Griffith et 
al.,\^m. Chem. Soc. 117:831-832, 1995). The sequence-specific and high affinity 
duplex ai^d triplex binding of PNA have been extensively described (Nielsen et al., 
Science 254SU97-1500, 1991; Egholm et al, J. Am. Chem. Soc. 114:9677-9678, 
1992; Egholm et^l., Nature 365:566-568, 1993; Almarsson et al, Proc. Natl Acad. 
Sci. U. S. A. 90:95^9546, 1993; Demidov et al., Proc. Natl Acad. Sci. U S A. 
92:2637-2641, 1995). i^ey have also been shown to be resistant to nuclease and 
protease digestion (Demidov^ al., Biochem. Pharm. 48:1010-1013, 1994). 

A representative PNA clamp is shown in Figs. 3 and 4. This PNA clamp, or a 
PNA clamp having any desired DNA sequence, is synthesized as described by 
Egholm et al. (supra.) or is available from PerSeptive Biosystems (Framingham, MA). 
The representative PNA clamp shown in Figs. 2 and 3 has the sequence 5'- 
TCTCTCTC-O-O-O-TJTJTJTJ (SEQ ID NO: 1), where J (pseudoisocytosine) is an 
analog of C, and the "0" residues are 8-amino-3,6-dioxaoctanoic acid linkers which 
separate the two regions of the PNA. 

The use of PNA tails, PNA clamps and PNA "clamp tails" to protect the ends 
of the transcriptionally active PCR fragments which are described above is illustrated 
in Figure 2. The ends of the PCR fragments generated as shown in Figure 1 are prone 
to digestion by exonucleases after transfection into a cell or tissue. Two strategies for 
inhibiting this degradation are shown in Figures 3 and 4. In Figure 1, a PNA tail and a 
short DNA sequence (5'-GATC-0-TCTCTCTC-3'; SEQ ID NO: 2) is added to the 5' 
end of primers PI and P2 which bind to the target sequence 5 ' -GAG AGAGA-3 ' (SEQ 
ID NO: 3). This generates a PNA binding site at the 3' end of the opposite strand 
during PCR. Primers containing PNA tails can be synthesized using methods well 
known in the art or are available from PerSeptive Biosystems. These PNA tails do not 
hybridize to a target sequence, but protect the 5' -ends from exonuclease digestion. In 
order to protect the 5' ends of the PCR fragment, a PNA molecule is added to the 3'- 
PNA tail-protected PCR fragment. This PNA contains a DNA sequence which is 
complementary to the 3' -end of the PCR fragment and binds thereto as shown in Fig. 



An alternate PNA protection approach is shown in Fig. 4. In this method, a 
PNA clamp is linked to primers PI and P2 via one or more 8-amino-3,6-dioxaoctanoic 
acid linkers to form a PNA "clamp" tail by well known methods. After PCR 
amplification, the "clamp tail" binds to the 3' ends of the PCR fragment, and is linked 
to the 5' ends via the one or more 8-amino-3,6-dioxaoctanoic acid linkers to protect 
both the 3' and 5' ends of the PCR fragment. 

Although the use of the CMV IE promoter and an artificial mammalian 
transcriptional terminator elements are exemplified herein, the use of any eukaryotic 
promoter and terminator is within the scope of the present invention. Suitable 
promoters for use in the present invention include, for example, SV40, rous sarcoma 
virus (RSV), retroviral long terminal repeats (LTR), muscle creatine kinase promoter, 
actin promoter, elongation factor promoter, synthetic promoters, tissue-specific 
promoters, and the like. Suitable terminator sequences include SV40 transcription 
terminator, bovine growth hormone (BGH) terminator, synthetic terminators, and the 
like. These promoter and terminator sequences can be obtained by restriction enzyme 
digestion of commercially available plasmids and cDNA molecules, or can be 
synthesized using an automated DNA synthesizer using methods well known in the 
art. 

Any desired gene may be amplified and coupled to active promoter and 
terminator sequences using the method of the present invention. In a preferred 
embodiment, the gene encodes a gene product which is absent or present at reduced 
levels in an organism. Nonlimiting examples of these gene products are the cystic 
fibrosis transmembrane regulator (CFTR), insulin, dystrophin, interleukin-2, 
interleukin-12, erythropoietin, gamma interferon, and granulocyte macrophage colony 
stimulating factor (GM-CSF). 

Although any transfection method well known in the art may be used to 
transfect the transcriptionally active PCR fragments of the invention into cells or* 
tissues including calcium phosphate precipitation, electroporation and DEAE-dextran, 
cationic lipid-mediated transfection is preferred. Gene delivery systems are described 
by Feigner et al. {Hum. Gene Ther. 8:511-512, 1997) and include cationic lipid-based 



delivery systems (lipoplex), polycation-based delivery systems (polyplex) and a 
combination thereof (Hpopolyplex). Cationic lipids are described in U. S. Patent Nos. 



4,897,355 and 5,459,127, the entire contents of which are hereby incorporated by 



The following components are combined in a 100 jal reaction volume: DNA 
fragment Fl (4 ng) which comprises a region of high transcriptional potency (-240 to 
+60) from the human cytomegalovirus (CMV) immediate early gene (IE) 
promoter/enhancer, DNA fragment F2 (4 ng), a 55 base pair oligonucleotide encoding 
an artificial mammalian transcription terminator element (5'- 
CACAAAAAACCAACACACAGATCTCTAGAGCTCTGATCTTTTATTAGCCA 
GAAGT-3'; SEQ ID NO: 4), 400 ng primer PI (5'- 
TCTCTCTACGTATTAGTC ATCG-3 ' ; SEQ ID NO: 5), 400 ng primer P2 (5'- 
TC AC AAAAAACC AAC AC AC AG-3 ' ; SEQ ID NO: 6), 4 ng primer P3 and 4 ng 
primer P4. The PI and P2 primer sequences correspond to the 5' end of fragment Fl 
and the 5' end of fragment F2, respectively (Fig. 1). Primers P3 and P4 are designed 
based on the gene sequence of interest. The 3' potion of these primers (about 10-20 
base pairs) is determined by the actual sequence of the gene to be amplified, whereas 
the 5' region of P3 comprises the following sequence: 5'-CTCCGCGGATCCAGA-3' 
(SEQ ID NO: 7) to overlap with the 3' region of Fl, and P4 comprises the sequence 
5 '-TTATTAGCCAGAAGT-3 ' (SEQ ID NO: 8) to overlap with the 3' region of F2. 

PCR is performed as follows: denaturation for 30 seconds at 94°C, annealing 
for 45 seconds at 55°C and extension for three minutes at 72°C for 25-30 cycles. The 
size of the final produce is verified by 1% agarose gel electrophoresis. The amplified 
PCR fragment is cleaned and purified using a commercial PCR cleaning kit (e.g., 
Qiagen), and can be used for in vitro or in vivo transfection of cells or tissues. 



The green fluorescent protein (GFP, 700 bp) was used as a target gene. The Fl 
fragment was a 600 bp fragment of the CMV immediate early gene promoter (from - 




reference. 



Example 1 

Production of transcriptionally active PCR fragments 



Example 2 



550 to +50). A 50 bp oligonucleotide containing a modified rabbit beta-globin gene 
transcription terminator was used as primer 2. Two different versions of these primers 
were designed so that the overlap between Fl and the intermediate GFP PCR 
fragment was either preceded with or without a thymidine base. After the PCR 
reaction was carried out according to the conditions described in Example 1, the 
products were analyzed by electrophoresis. When a thymidine base preceded the 
overlap region in Fl, a clean major PCR fragment of about 1.4 kb was produced, 
representing the GFP coding region flanked by CMV promoter (Fl) and the 
transcription terminator, whereas if a base other than thymidine preceded the 
overlapping region, no product was generated. The fragment generated by nested 
PCR was then transfected into Cos-7 cells and the expression of GFP was monitored 
by fluorescence microscopy. The results showed that the intensity and frequency of 
GFP-expressing cells was almost the same between cells transfected with the PCR 
fragment and a supercoiled plasmid DNA in which the same CMV promoter was 
driving expression of the GFP gene. Thus, the present method produces DNA 
fragments which are transcriptionally active. 

While particular embodiments of the invention have been described in detail, it 
will be apparent to those skilled in the art that these embodiments are exemplary 
rather than limiting, and the true scope of the invention is that defined in the following 
claims. 
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